Biclustering of gene expression data by an extension of mixtures of factor analyzers.

نویسندگان

  • Francesca Martella
  • Marco Alfò
  • Maurizio Vichi
چکیده

A challenge in microarray data analysis concerns discovering local structures composed by sets of genes that show homogeneous expression patterns across subsets of conditions. We present an extension of the mixture of factor analyzers model (MFA) allowing for simultaneous clustering of genes and conditions. The proposed model is rather flexible since it models the density of high-dimensional data assuming a mixture of Gaussian distributions with a particular omponent-specific covariance structure. Specifically, a binary and row stochastic matrix representing tissue membership is used to cluster tissues (experimental conditions), whereas the traditional mixture approach is used to define the gene clustering. An alternating expectation conditional maximization (AECM) algorithm is proposed for parameter estimation; experiments on simulated and real data show the efficiency of our method as a general approach to biclustering. The Matlab code of the algorithm is available upon request from authors.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Mixtures of common t-factor analyzers for clustering high-dimensional microarray data

MOTIVATION Mixtures of factor analyzers enable model-based clustering to be undertaken for high-dimensional microarray data, where the number of observations n is small relative to the number of genes p. Moreover, when the number of clusters is not small, for example, where there are several different types of cancer, there may be the need to reduce further the number of parameters in the speci...

متن کامل

به کارگیری خوشه‌بندی دوبعدی با روش «زیرماتریس‌های با میانگین- درایه‌های بزرگ» در داده‌های بیان ژنی حاصل از ریزآرایه‌های DNA

Background and Objective: In recent years, DNA microarray technology has become a central tool in genomic research. Using this technology, which made it possible to simultaneously analyze expression levels for thousands of genes under different conditions, massive amounts of information will be obtained. While traditional clustering methods, such as hierarchical and K-means clustering have been...

متن کامل

Effect of different concentrations of leukemia inhibitory factor on gene expression of vascular endothelial growth factor-A in trophoblast Tumor Cell Line

Background: Several studies have shown that leukemia inhibitory factor (LIF) is one of the most important cytokinesparticipating in the process of embryo implantation and pregnancy, while, the role of this factor on vascular endothelialfactor-A (VEGF-A), as one of the most important angiogenic factor, has not been fully investigated yet. The aimof this study was to evaluate th...

متن کامل

The Effect of Aerobic Training on Tumor Necrosis Factor alpha, Hypoxia-Inducible Factor-1 alpha & Vascular Endothelial Growth Factor Gene Expression in Cardiac Tissue of Diabetic Rats

Objective: The goal of this research was to determine the influence of 4 weeks aerobic training on gene expression of tumor necrosis factor alpha (TNF-α), hypoxia-inducible factor-1 alpha (HIF-1α) and vascular endothelial growth factor (VEGF) in the cardiac tissue of diabetic rats. Materials and Methods: In an experimental study, 30 male wistar rats were partitioned into three groups (n=10), d...

متن کامل

Biclustering Gene Expressions Using Factor Graphs and the Max-Sum Algorithm

Biclustering is an intrinsically challenging and highly complex problem, particularly studied in the biology field, where the goal is to simultaneously cluster genes and samples of an expression data matrix. In this paper we present a novel approach to gene expression biclustering by providing a binary Factor Graph formulation to such problem. In more detail, we reformulate biclustering as a se...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:
  • The international journal of biostatistics

دوره 4 1  شماره 

صفحات  -

تاریخ انتشار 2008